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ABSTRACT 

The additional G_ 1 nucleotide on tRNA Hls is a nearly 
universal feature that specifies tRNA Hls identity in all 
three domains of life. In eukaryotes, the G_ 1 identity 
element is obtained by a post-transcriptional 
pathway, through the unusual 3-5 polymerase 
activity of the highly conserved tRNA Hls guanylyl- 
transferase (Thg1) enzyme, and no examples of 
eukaryotic histidyl-tRNAs that lack this essential 
element have been identified. Here we report that 
the eukaryote Acanthamoeba castellanii lacks the 
G_i identity element on its tRNA Hls , consistent with 
the lack of a gene encoding a bona fide Thg1 
ortholog in the A. castellanii genome. Moreover, 
the cytosolic histidyl-tRNA synthetase in A. 
castellanii exhibits an unusual tRNA substrate spe- 
cificity, efficiently aminoacylating tRNA Hls regard- 
less of the presence of G_ 1 . A. castellanii does 
contain two Thg1 -related genes (encoding Thg1- 
like proteins, TLPs), but the biochemical properties 
we associate here with these proteins are consist- 
ent with a function for these TLPs in separate 
pathways unrelated to tRNA Hls metabolism, such 
as mitochondrial tRNA repair during 5 -editing. 

INTRODUCTION 

Transfer RNA species are extensively processed before 
their use in translation. These processing events include 
various types of RNA editing and modification reactions 
that alter the sequence or chemical structure of the nucleo- 
tide bases from that originally encoded in the genome 
(1-4). Enzymes that carry out tRNA editing and modifi- 
cation are found in all three domains of life, and many 
comprise highly conserved enzyme families that catalyse 



similar reactions with tRNAs from multiple organisms. 
One of the most highly conserved tRNA modifications is 
the presence of an additional G residue (G_i) at the 5' end 
of tRNA Hls , which is a unique distinguishing feature of 
tRNA Hls in all three domains of life. Only histidine 
tRNAs have the extra G nucleotide, and the presence of 
G_i is required for efficient recognition of the tRNA for 
aminoacylation by the histidyl-tRNA synthetase (HisRS) 
(5-8). 

Although the G_i residue is a ubiquitous feature, it is 
obtained by different mechanisms depending on domain 
of life. In bacteria (and many archaeons), G_i is encoded 
in the 5'-leader sequence of the tRNA Hls gene and 
transcribed as part of the pre-tRNA, following which an 
alternative ribonuclease P (RNase P)-catalysed 5'-process- 
ing event results in retention of G_i in mature tRNA Hls 
(9). The presence of an encoded G_i is nearly 100% 
conserved throughout all sequenced bacterial genomes: 
only one small group of ~20 a-proteobacteria has a 
tRNA Hls that does not contain a G_i residue (10). In 
eukaryotes, however, the tRNA Hls gene does not have 
an encoded G_i, and the identity element is added post- 
transcriptionally to the RNase P-processed tRNA Hls by 
the action of the tRNA Hls guanylyltransferase (Thgl) 
(Figure 1A) (11,12). First identified in Saccharomyces 
cerevisiae, Thgl is a member of a superfamily of reverse 
(3—5') polymerase enzymes, and in eukaryotes, Thgl uses 
this 3 —5' nucleotide addition activity to add the single 
non-templated G_i residue, opposite A 73 , to the 5'-end 
of tRNA His (16,17). Until now, there have been no 
examples of eukaryotic tRNA Hls species that lack this im- 
portant recognition element. 

Although addition of the G_j element to tRNA Hls was 
the first activity associated with the Thgl superfamily, sub- 
sequent biochemical characterization of other Thgl- 
related enzymes revealed a functionally distinct group of 
enzymes known as Thgl-like proteins (TLPs). TLPs, which 
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Figure 1. Multiple roles for 3'— 5' addition catalysed by Thgl family enzymes. (A) Participation of Thgl enzymes in tRNA Hls maturation in 
eukaryotes. In this reaction, which is highly selective for tRNA Hls , Thgl adds a single non-templated G_] residue to the 5'-end of the tRNA 
following processing of the pre-tRNA by RNase P (11,12). This essential function of Thgl in eukaryotes serves to add the required G_j identity 
element for recognition by the HisRS. (B) Predicted participation of TLPs in mt-tRNA 5'-editing in protozoa. The process is a two-step reaction 
catalysed by an unidentified nuclease to remove mismatches from the 5'-end of the encoded tRNA, followed by repair of the 5'-truncated tRNA 
using 3'— 5' polymerase activity to restore a fully base-paired aminoacyl-acceptor stem (13). The 5'-tRNA repair reaction is efficiently catalysed by 
TLP members of the Thgl superfamily (14,15). 



are abundant in the domain Archaea, and more sporadic- 
ally distributed throughout Bacteria and Eucarya, do not 
efficiently add a non-templated G_i nucleotide to 
tRNA Hls , nor do they exhibit the rigorous specificity 
for tRNA Hls associated with eukaryal Thgl enzymes 
(14,17-20). Instead, these enzymes selectively catalyse 
template-dependent 3'— 5' polymerization and exhibit a pro- 
ficient tRNA 5'-end repair activity, suggesting alternative 
roles for these enzymes in vivo (14,15). Although the precise 
physiological role for TLPs in Bacteria and Archaea 
remains unknown, two TLP gene products from the 
eukaryotic slime mold Dictyostelium discoideum were 
recently implicated in a mitochondrial tRNA (mt-tRNA) 
5'-editing reaction (Figure IB), based on their ability to 
catalyse repair of 5'-truncated mt-tRNAs (15). 

Consistent with the distinct biochemical properties 
associated with Thgl and TLP enzymes investigated so 
far, a phylogenetic analysis of Thgl and TLP gene 
sequences reveals that these genes segregate into distinct 
clades that can also be distinguished on the basis of 
conserved sequence motifs (17,21). The Thgl clade 
contains genes closely related to S. cerevisiae THG1 and 
the presence of a Thgl gene is widespread in eukaryotes, 
consistent with the critical importance of the G_i identity 
element and the highly conserved mechanism of its 
post-transcriptional addition in all eukaryotes studied 



to date. As the vast majority of studied eukaryotes are 
not predicted to contain mt-tRNAs that require 5' editing 
(and therefore do not encode any TLP genes), most eukary- 
otes encode only a single Thgl gene that clusters as an 
ortholog to other bona fide Thgl enzymes. However, a 
deeper investigation of the Thgl family tree reveals 
several eukaryotes that lack any identifiable 
Thgl homolog, and thus are notable exceptions to this 
general rule (12,17). These organisms can be classified 
further into two groups. The first group contains those 
that lack any identifiable Thgl or TLP gene, a group that 
consists primarily of protists (eukaryotic microbes), 
including diatoms and several types of algae. The basis 
for the apparent lack of any Thgl -related gene in these 
species has not been determined. A second group consists 
of organisms that lack an identifiable Thgl gene, but 
encode one or more TLPs. This group includes several 
fungi (Phaeosphaeria nodorum and Spizellomyces 
punctatus) as well as the amoebozoan Acanthamoeba 
castellanii (17). In the case of A. castellanii, we have 
previously emphasized that the lack of an identifiable 
Thgl gene in this organism is unlikely to be due to 
incomplete genome data, given the extensive genomic and 
transcriptomic coverage in the genome project for this 
organism (http://www.hgsc.bcm.tmc.edu/projects/micro- 
bial/microbial-pubPreview.xsp?project_id = 163). 
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We chose to investigate the two TLP genes encoded in 
the genome of A. castellanii (designated AcaTLPl and 
AcaTLP2) to answer several questions related to the bio- 
logical function(s) of these genes and the apparently 
non-canonical requirements for tRNA Hls maturation in 
this species. Because, like all Thgl/TLP enzymes studied 
so far, the AcaTLPs contain all of the functional sequence 
motifs associated with the ability to catalyse 3—5' nucleo- 
tide addition (Supplementary Figure SI) (16,22,23), one 
obvious possibility was that, despite the inability of all 
previously characterized TLP enzymes to efficiently 
catalyse G_i addition to cytosolic tRNA Hls , one or both 
of the AcaTLPs might exhibit this activity, and thus 
maintain a Thgl family-dependent pathway for tRNA Hls 
maturation. Our data do not support this hypothesis, as 
here we show that neither of the AcaTLPs catalyses G_i 
addition to cytosolic tRNA Hls at significant levels. 
Instead, consistent with the lack of a predicted Thgl 
gene in A. castellanii, cloning and sequencing of the cyto- 
solic A. castellanii tRNA Hls reveals that this tRNA lacks 
the characteristic G_i element. Remarkably, unlike any 
other previously characterized eukaryotic HisRS, we also 
demonstrate that purified cytosolic HisRS from 
A. castellanii does not depend on the G_] identity 
element for aminoacylation of cytosolic tRNA Hls . Thus, 
we have identified the first example of a eukaryotic 
organism that breaks the rule regarding presence of the 
G_i element for specifying tRNA Hls identity. These 
findings are the first demonstration of absence of the uni- 
versal G_i identity element in eukaryotic tRNA Hls , and 
open the door to investigation of other non-canonical 
mechanisms for specifying tRNA Hls identity in 
eukaryotes. 

MATERIALS AND METHODS 

Cloning and purification of A. castellanii 
TLPs and HisRS 

A. castellanii cDNA library was synthesized from ~3 ug 
A. castellanii Neff total RNA (generously provided by 
Dr R. Gawryluk, Dalhousie University) using 2.5 uM 
oligo(dT) 2 o primer and Superscript III reverse tran- 
scriptase (RT) (Life Technologies, Grand Island, NY) ac- 
cording to the manufacturer's protocol. The resulting 
cDNA was treated with RNase H and RNase A, 
purified by phenohchloroform extraction and used as the 
template for PCR with AcaTLPl, AcaTLP2 and 
AcaHisRS gene-specific primers (Supplementary Figure 
S2), using iProof DNA polymerase (BioRad, Hercules, 
CA). Gene, coding and protein sequences were assembled 
from genomic and EST data publicly available in 
GenBank, supplemented by unpublished RNAseq data 
kindly provided by Prof. Brendan Loftus (Conway 
Institute, Dublin) and Prof. Andrew Roger (Dalhousie 
University) (see Supplementary Figure S2). Gel-purified 
PCR products were cloned by ligation-independent 
cloning into a previously described vector (AVA421) 
that contains sequences for expression of an iV-terminal 
His 6 -tag for affinity purification of the overexpressed 
protein (24). 



Available EST data for AcaTLPl support the presence 
of a short poly-serine stretch at the extreme C-terminus of 
AcaTLPl (Supplementary Figure SI). However, as the 
exact number of serine residues in this sequence is 
somewhat ambiguous (owing to differences between the 
genomic sequence and a few ESTs), and because the 
residues at the C-terminal end are not highly conserved 
and not essential for G_i addition activity with human 
Thgl (16), we did not include these 7 terminal serine 
residues, or the preceding threonine, in the final clones. 

Sequence-verified clones [GenBank accession numbers 
JX547005 (AcaTLPl), JX547006 (AcaTLP2) 
and JX547007 (AcaHisRS)] were transformed into 
Rosetta(pLysS) (AcaTLPs) or BL21(DE3)pLysS 
(AcaHisRS) Escherichia coli strains and induction of 
protein expression and purification of proteins was per- 
formed as described, except that induction was performed 
at 37°C for 6h for the AcaTLPs and 18°C overnight for 
the HisRS proteins (15). All resulting proteins were 
dialysed into buffer containing 50% glycerol, 20 mM 
Tris, 4 mM MgCl 2 , 1 uM ethylenediamine tetraacetic 
acid (EDTA), 1 mM dithiothreitol, pH 7.5 for storage at 
— 20°C. AcaTLP dialysis buffer additionally contained 
0.5 M NaCl, whereas AcaHisRS dialysis buffer also con- 
tained 0.05 M NaCl. Protein concentrations were 
determined using the BioRad protein assay, and recom- 
binant proteins were judged to be >90% pure by sodium 
dodecyl sulphate-polyacrylamide gel electrophoresis 
analysis of purified proteins. 

G_x addition/5' -end repair activity assays 

A previously described phosphatase protection assay was 
performed to determine the biochemical activity of the 
purified AcaTLPs (25). [5'- 32 P]tRNAs (10-50 nM) were 
incubated with purified AcaTLPs (or SceThgl, bacterial 
TLP and/or DdiTLPs in control assays) in Thgl reaction 
buffer [25 mM 4-(2-hydroxyethyl)-l-piperazineethanes- 
ulfonic acid, 0.2mg/ml bovine serum albumin, 3mM 
dithiothreitol, pH 7.5], additionally containing 10 mM 
MgCl 2 and 125 mM NaCl (standard SceThgl assay con- 
ditions), or 10 mM MnCl 2 and 50 mM NaCl (as indicated 
for each figure) for 1-2 h at room temperature. Reactions 
were quenched by addition of EDTA, followed by diges- 
tion with RNase A to generate labeled oligonucleotide 
reaction products. After treatment of the digested 
reaction products with alkaline phosphatase, unreacted 
substrate tRNAs are visualized as inorganic phosphate, 
and 5'-nucleotide addition products are visualized as in- 
ternally labeled oligonucleotides (as indicated in figure 
legends for each assay). Reaction products were resolved 
using silica thin-layer chromatography (TLC) plates in a 
55:35:10 (v:v:v, 1-propanol: NH 4 OH: H 2 0) solvent 
system. 

Determination of steady-state kinetic parameters 

Steady-state kinetic assays were performed with 
A. castellanii mt-tRNA Leu AG +2 substrate and purified 
AcaTLPl or AcaTLP2-mito, as previously described 
(22). Linear initial rates were determined from time 
courses of assays containing 0.1-25 uM tRNA and 
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1-60 nM purified enzyme, maintaining a minimum excess 
of at least 5-fold substrate over enzyme in each reaction. 
Reactions were performed in standard Thgl reaction 
buffer and time courses were selected to yield <10% con- 
version to product over the course of the measured 
reaction. Initial rates (normalized for [TLP]) were 
derived from at least three independent assays, plotted 
as a function of [tRNA] and fitted to the Michaelis- 
Menten equation for determination of steady-state 
kinetic parameters. 

Determination of G_i status of A. castellanii 
cytosolic tRNA His 

Total RNA was isolated from A. castellanii cells (~10 6 
cells, generously provided by Dr Greg Booton, Ohio 
State University) using Tri Reagent (Sigma) according to 
the supplier's instructions. Purified RNA (5 10 ug) was 
circularized using T4 RNA ligase (USB) as described in 
(26), and circularized RNA (circRNA) was purified by 
phenolxhloroform extraction, followed by ethanol pre- 
cipitation using 5 ug glycogen. CircRNA (2 ug) was used 
as the template for cDNA synthesis with 2pmol 
AcaHisRT primer (5'-GAC CTGCAG ACCAATGTACT 
ACCAC-3'); the primer sequence includes a PstI restric- 
tion site (underlined) for the subsequent cloning steps. 
After heating together and cooling the primer and 
circRNA (95°C for 4min, followed by slow cooling to 
room temperature for 30min), cDNA synthesis was per- 
formed with Superscript III RT according to the manu- 
facturer's instructions, for 1 h at 50°C. cDNA reactions 
were treated with RNase A (0.5 mg/ml final concentration) 
to remove remaining RNA, and reactions were purified by 
phenolxhloroform extraction followed by ethanol precipi- 
tation. The purified cDNA was resuspended in 20 ul TE 
pH 8.0, and 3-5 ul of purified cDNA was used as the tem- 
plate for PCR with AcaHisRT and AcaHis3' (5'-GCAGG 
ATCCCAACCCGGGTTCAATTCCCG-3') primers. 
AcaHis3' primer also contains a BamHI restriction site 
(underlined) for downstream cloning. PCR reactions 
were performed with iProof polymerase (BioRad) accord- 
ing to the manufacturer's instructions. Reverse 
transcriptase-polymerase chain reaction (RT-PCR) 
products of the expected size were purified by electrophor- 
esis in 1% agarose gels, digested with BamHI and PstI 
(NEB) and subsequently ligated into a similarly digested 
pUC19 vector using O.lU/ul T4 DNA ligase (USB). 
Plasmid DNA was isolated from individual positive 
clones and sequenced. 

Histidylation assays 

CCA labeling of tRNAs with [a- 32 P]-ATP and 
histidylation assays were performed essentially according 
to reference (27), according to details provided in the 
Supplementary Data Methods. The 3'-labeled tRNA 
(labeled at the 5'-phosphate of A 76 ) was incubated with 
250 uM histidine, 4mM ATP and See or Aca cyto HisRS 
(2.5-0.1 uM; in 5-fold serial dilutions, as indicated), in 
aminoacylation buffer (27). PI -digested reactions were 
resolved by PEI-cellulose TLC using a glacial acetic 



acid: 1 M NH 4 C1: water (5:10:85, v:v:v) solvent system, 
and plates were visualized using a Typhoon trio imager. 

RESULTS 

A. castellanii TLP1 and TLP2 lack the ability to catalyse 
efficient G_i addition to cytosolic tRNA Hls in vitro 

We cloned both of the AcaTLP coding sequences into a 
vector previously used for expression of Thgl -family 
proteins in E. coli, and used these vectors for purification 
of the proteins according to previous protocols (18) 
(Supplementary Figures SI and S2). As AcaTLP2 is pre- 
dicted to contain a 44-amino acid TV-terminal mitochon- 
drial targeting signal, expression clones were generated 
that both contain and lack this TV-terminal sequence, 
designated AcaTLP2 and AcaTLP2-mito, respectively 
(Supplementary Figure SI). 

The three proteins (AcaTLP 1, AcaTLP2 and 
AcaTLP2-mito) were overexpressed in E. coli and 
purified using immobilized metal-ion affinity chromatog- 
raphy. The resulting purified proteins were tested for 
ability to add G_i to a cytosolic tRNA Hls substrate, 
which contains the A 73 discriminator nucleotide that is 
present in all eukaryotic cytosolic tRNA Hls species. 
Reactions with any of the purified AcaTLP proteins ex- 
hibited little detectable G_i addition product, with a 
maximum yield of 0.1% and 1% G_i addition product 
in reactions containing the highest concentration 
AcaTLP 1 and AcaTLP2, respectively, as compared with 
40% in the control reaction with S. cerevisiae Thgl 
(SceThgl) (Figure 2A). Although the tRNA His used for 
these assays was derived from 5. cerevisiae, not 
A. castellanii, and thus the possibility of minor kinetic 
differences between this tRNA and the native substrate 
cannot be excluded, the complete lack of observable G_i 
addition, coupled with the fact that the tRNA His GUG 
anticodon is the major determinant for recognition of 
the tRNA for G_! addition by yeast Thgl (25), indicate 
that the AcaTLPs are not efficient G_i addition enzymes. 

In these same assays, each of the AcaTLP enzymes also 
produced small, but detectable amounts of products 
whose migration is consistent with previously identified 
activated intermediates resulting from addition of a 5'-ad- 
enylate or guanylate to the 5'-end of the tRNA during the 
first step of 3—5' nucleotide addition (18). These products 
are evident in reactions with high concentrations of 
AcaTLPs, suggesting a similar block to nucleotide 
addition following the activation step of the reaction to 
that observed previously for other TLPs (Figure 2A). The 
presence of the mitochondrial targeting peptide on 
AcaTLP2 does not appear to substantially affect the 
pattern or amount of product produced. 

AcaTLPl and AcaTLP2 catalyse template-dependent 
3-5' polymerase activity 

A hallmark of other studied TLPs is their ability to effi- 
ciently catalyse template-dependent Watson-Crick 3—5' 
polymerase activity, but not non-templated 3—5' 
addition (18). This property explains the general inability 
of TLPs to catalyse G_j addition to wild-type cytosolic 
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Figure 2. AcaTLPs catalyse templated, but not non-templated, G_i addition to tRNA Hls . (A) G_, addition assay to wild-type yeast tRNA Hls . 
Phosphatase protection assays were used to determine whether either of the AcaTLPs catalyse addition of G_i to 5'- 32 P labeled tRNA; addition 
of G_i results in production of a labeled G_!p*GpC oligonucleotide. Reactions contained 5-fold serial dilutions of purified AcaTLPs (0.4mg/ml- 
3 ug/ml AcaTLPl and AcaTLP2, 1.2 mg/ml-10 ug/ml AcaTLP2-mito), or 1 mg/ml other purified TLP/Thgl enzymes as indicated; — , no enzyme 
control. Reactions shown were performed in Thgl assay buffer containing 125 mM NaCl, 10 mM MgCl 2 . (B) G_] addition assay with C 73 -tRNA Hls 
variant. Phosphatase protection assays were performed with serial dilutions of purified enzyme as in (A) (0.4mg/ml-0.6 ug/ml AcaTLPl and 
AcaTLP2, 1.2mg/ml-2 ug/ml AcaTLP2-mito); addition of G_i is indicated by G_]p*GpC reaction product, and additional polymerase products 
(due to subsequent addition of templated G_ 2 and G_ 3 nucleotides) are indicated. Reactions shown were performed in Thgl reaction buffer 
containing 50 mM NaCl, lOmM MnCl 2 . 



tRNA Hls in eukaryotes, as addition of G_i would result in 
formation of a non-canonical G_i-A 73 base pair (see 
Figure IB). However, when a tRNA Hls variant that 
contains a C 7 3-discriminator nucleotide is used as the sub- 
strate for G_i addition, TLPs from Bacteria, Archaea and 
D. discoideum efficiently add G_i (and sometimes multiple 
G-residues), reflecting their preference for Watson-Crick 
base pair formation (14,15,18). 

To determine whether AcaTLPs exhibit similar 
behavior, we tested activity of each enzyme with a C 73 - 
containing variant of cytosolic tRNA Hls , using a similar 
assay with 5'- 32 P-labeled tRNA. In the assays, AcaTLPl 
and AcaTLP2 both catalysed 3—5' addition of G_i as well 
as formation of additional products, which migrate ac- 
cording to the positions of 3—5' polymerase reaction 
products (corresponding to G_ 2 and G_ 3 additions). 
AcaTLP2 is apparently more active than AcaTLPl 
under these assay conditions, and again, the presence of 
the mitochondrial targeting signal in the purified protein 
has little discernible effect on activity (Figure 2B). The 
ability to efficiently add G_i to the C73-containing, but 
not A 73 -containing, tRNA His affirms that the AcaTLPs 
exhibit biochemical properties similar to other TLPs 
characterized so far, and thus are unlikely to participate 
in cytosolic tRNA Hls maturation in vivo in A. castellanii. 
Moreover, neither of the AcaTLPs is able to support 
growth of the S. cerevisiae thgl A strain when provided 
as the sole source of Thgl activity in the cell 



(Supplementary Figure S3), consistent with the results of 
the activity assays. This observation helps to rule out the 
possibility that a low level of non-templated G_i addition 
activity is exhibited by the enzymes, which is apparently 
sufficient to support tRNA Hls maturation in vivo as 
observed previously for one bacterial TLP (14,20). 

Cytosolic tRNA Hls in A. castellanii lacks the canonical 
G_i identity element 

As the A. castellanii genome does not encode a bona fide 
Thgl ortholog, and neither of the AcaTLPs appears to be 
able to replace this enzymatic activity, we considered two 
possibilities for the maturation of tRNA Hls in 
A. castellanii. First, it is possible that another enzyme in 
A. castellanii, unrelated to the Thgl family, catalyses 
this essential activity. Second, the tRNA Hls in 
A. castellanii might be unusual, lacking the G_i identity 
element that is the hallmark of all other studied eukaryotic 
tRNA Hls species (28). To distinguish between these 
possibilities, we determined the G_i status of the cytosolic 
tRNA Hls in A. castellanii. Total RNA was purified from 
A. castellanii cells and subjected to circularization using 
T4 RNA ligase, followed by RT-PCR to amplify the 
junction region formed between the mature 5'- and 
3'-ends of the tRNA, then cloning and sequencing of the 
resulting PCR products. Sequences were obtained for 
12 full-length clones, and for 11 of 12 sequences, the 
clones lacked a G_i nucleotide and instead resulted 
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of 12 total sequenced); arrows indicate the direction of the 5'- and 3'-end sequences corresponding to the sequence on the tRNA diagram. 



from ligation of the predicted 5' and 3'-ends without G_i 
addition (Figure 3). For the remaining sequenced clone, 
the RT-PCR product similarly lacked a G_i nucleotide, 
but an additional C-residue between the 5'- and 3'-ends 
(the origin of which is unclear) was evident from the 
sequence. Thus, the requirement for G_i addition for 
cytosolic tRNA Hls maturation in A. castellanii appears 
to be different from that of all other known eukaryotes, 
resulting in tRNA Hls that lacks this otherwise essential 
identity element. 



Cytosolic HisRS in A. castellanii is a non-canonical 
eukaryotic HisRS 

Previous studies with cytosolic HisRS from Bacteria and 
Eucarya have demonstrated a substantially decreased 
(~ 100-1 000-fold in catalytic efficiency) ability of these 
enzymes to aminoacylate tRNA Hls species that lack the 
G_i identity element (5,7,29). This strong dependence on 
G_i is similarly observed in vivo, and in S. cerevisiae, the 
major cause of the growth defect of the thglA strain has 
been attributed to loss of aminoacylated tRNA Hls as a 
consequence of loss of the G_i residue from the tRNA 
(30,31). To determine whether the lack of G_i on 
tRNA Hls is accommodated differently in A. castellanii, 
we cloned, expressed and purified the cytosolic A. 
castellanii HisRS (AcaHisRS) from the cDNA library, as 
described above for the AcaTLPs (see Supplementary 
Figure S2). The purified AcaHisRS was tested for its 
ability to aminoacylate two tRNA Hls substrates that are 
otherwise identical, differing only by the presence or 
absence of G_i. Aminoacylation assays were performed 
as previously described with substrates labeled with 32 P at 
the 5'-phosphate of A 76 ; following digestion of the reac- 
tions with nuclease PI, the ester-linked histidyl-A 76 
product is resolved from the unreacted 3'-terminal nucleo- 
tide by TLC (27). Control reactions with purified S. 
cerevisiae HisRS (SceHisRS) demonstrate the expected 
strict dependence on the presence of G_i, as only the 
G_i-containing tRNA Hls is aminoacylated to detectable 
levels in the in vitro assay (Figure 4). In contrast, the 
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Figure 4. The G_, nucleotide is not an identity element for AcaHisRS. 
Aminoacylation assays were performed with two yeast tRNA Hls tran- 
scripts, one lacking the G_[ residue (A) and one in which the G_| 
residue is included as a part of the in vitro transcript (G_i). Both 
tRNAs were 32 P-labeled at the 5'-phosphate of the terminal A 7 6 nu- 
cleotide according to previously described protocols. After incubation 
of the labeled tRNA with three dilutions of purified HisRS enzymes 
(2.5-0.1 uM from S. cerevisiae or A. castellanii, as indicated) in the 
presence of ATP and histidine, the reactions were digested with 
nuclease PI to release the labeled 3'-terminal A 76 nucleotide, and 
TLC was used to resolve the histidylated-AMP (product of the 
reaction) from the AMP generated by digestion of unreacted substrate. 
Lanes indicated by a minus sign (-) contain no enzyme in the 
aminoacylation reactions. 



AcaHisRS generated similar amounts of aminoacylated 
tRNA Hls from both the G_j -containing and G_j -lacking 
tRNA Hls substrates. Thus, the G_i residue is not a critical 
identity element for recognition of the tRNA by cytosolic 
AcaHisRS. 

Both AcaTLPl and AcaTLP2 exhibit tRNA repair 
of 5' -edited mt-tRNAs 

As our data do not suggest a role for either of the 
AcaTLPs in tRNA Hls maturation, we finally tested 
whether either of the AcaTLPs catalyses 5' -end repair of 
truncated tRNAs, as would be expected of enzymes that 
participate in 5'-editing of mt-tRNA, which occurs with 12 
of 15 mt-tRNAs in A. castellanii (32-34). The substrate 
used for these assays was a mitochondrially encoded 
tRNA LeuUAA from A. castellanii. This tRNA undergoes 
editing to remove A +1 and U +2 nucleotides that form 
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Figure 5. AcaTLPs both catalyse efficient repair of 5'-truncated A. castellanii mt-tRNA. Phosphatase protection assays were performed with 
mt-tRNA Leu A +2 substrate to detect repair by addition of missing G +2 and G +1 nucleotides, which generates a G +1 G +2 p*GGAU oligonucleotide, 
as indicated on the TLC plate. Reactions contained 5-fold serial dilutions of AcaTLPs (0.4mg/ml-0.6ug/ml AcaTLPl and AcaTLP2, 1.2mg/ml- 
2ug/ml AcaTLP2-mito), or 1 mg/ml of other purified TLP/Thgl enzymes, as indicated; — , no enzyme control. 



mismatches at the 5'-end of the aminoacyl acceptor stem, 
and to replace those positions with the correctly 
base-paired G+i and G +2 (33). This second reaction 
requires 3-5' polymerase activity, as recently associated 
with two TLPs encoded in D. discoideum (DdiTLP3 and 
DdiTLP4) (15). We used a tRNA (mt-tRNA Leu A +2 ) that 
mimics the presumed in vivo substrate for 5'-end repair, in 
which the two 5'-mismatched nucleotides have been 
removed and the resulting tRNA transcript begins with 
G+3 (Figure 5). For these assays, we used the 
5'- 32 P-labeled tRNA, and looked for production of a pro- 
tected oligonucleotide fragment (corresponding to 
GGp*GGAU) after incubation with TLPs followed by 
RNase A digestion and phosphatase treatment. In sharp 
contrast to the G_i addition assays described above 
(Figure 2), both of the AcaTLPs catalyse robust 
addition of the missing 5'-G nucleotides to this substrate 
(Figure 5); in fact, we observed nearly 100% conversion to 
product even at the lowest concentration of enzyme added 
in the enzyme titrations, which are the same concentra- 
tions tested in Figure 2. As previously demonstrated for 
other 5'-truncated tRNA species, SceThgl catalysed little 
detectable 5'-end repair, and other TLPs (including 
DdiTLP3, DdiTLP4 and BthTLP — a bacterial TLP from 
Bacillus thuringiensis) catalysed efficient repair, generating 
the same products as the AcaTLPs (14). Measurement of 
steady-state kinetic parameters for the repair activity 
demonstrates that AcaTLPl and AcaTLP2 both catalyse 
repair of the 5'-truncated mt-tRNA Leu with similar overall 
catalytic efficiencies (1.4 ± 0.4 x 10 4 M _1 s _1 versus 
1.2 ± 0.5 x 10 4 M _1 s _1 , respectively) (Supplementary 
Table SI). Thus, either of these enzymes is biochemically 
competent to participate in the 5'-end repair reaction that 
occurs during mt-tRNA editing in vivo in A. castellanii. 



DISCUSSION 

Repeated negative searches of extensive genomic and 
transcriptomic sequence data effectively demonstrate 
that the amoeboid protozoan A. castellanii lacks a canon- 
ical Thgl ortholog that is otherwise ubiquitous among 
previously studied eukaryotes. Although A. castellanii 
encodes genes specifying two related members of the 
Thgl superfamily (AcaTLPl and AcaTLP2), neither of 



these TLP enzymes is able to substitute for Thgl by 
catalysing the G_i addition reaction that is canonically 
required to specify tRNA Hls identity in eukaryotes. This 
conclusion is based on lack of in vitro G_i addition 
activity exhibited by purified AcaTLPl and AcaTLP2 
with cytosolic tRNA Hls substrates, as well as inability of 
these enzymes to complement the growth defect of the 
yeast thglA strain (Figure 2 and Supplementary 
Figure S3). In fact, sequencing of tRNA Hls reveals that 
the mature tRNA Hls in A. castellanii lacks the G_i 
element (Figure 3), and that the cytosolic HisRS in 
A. castellanii does not require the presence of this G_i 
nucleotide for aminoacylation of tRNA Hls (Figure 4). 
Thus, we have identified a cytosolic eukaryotic 
tRNA Hls -HisRS pair that does not rely on the canonical 
G_i identity element to maintain fidelity of translation. 
Interestingly, the two A. castellanii TLPs exhibit biochem- 
ical activities that are similar to other studied TLPs (from 
Bacteria, Archaea and D. discoideum) in that they prefer- 
entially catalyse template-dependent 3'— 5' addition and 
use this activity to efficiently repair 5'-truncated tRNAs 
(Figure 5, Supplementary Table SI). Therefore, as 
observed previously with the slime mold D. discoideum, 
there are two distinct gene products in A. castellanii that 
could potentially play a role in 5'-editing of mt-tRNA 
species in vivo. 

Although this is the first example of absence of the 
critical G_i identity element from tRNA Hls in eukaryotes, 
there has been one other demonstration of a similar phe- 
nomenon in ~20 a-proteobacterial species from the 
RC-clade (including Rhizobiales and Caulobacterales) 
(35). Normally, bacterial tRNA Hls genes contain an 
encoded G_] residue that is predictably retained from 
the precursor tRNA transcript after processing by the 
5'-maturation enzyme RNase P (9). However, in these 
a-proteobacterial species, the tRNA Hls genes do not 
contain the encoded G_i, and consequently the mature 
tRNA Hls (as in A. castellanii) lacks G_i. The ability of 
these organisms to carry out aminoacylation of the 
tRNA Hls lacking the G_i identity element is owing to 
changes both in tRNA Hls and the HisRS. However, 
there are significant differences between the bacterial 
case and the situation in A. castellanii, suggesting that 
absence of the identity element arose from independent 
events at least twice during evolution. 
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Recognition of tRNA 1S by Caulobacter crescentus 
HisRS (CcHisRS) does not require a G_i nucleotide 
and, in fact, the presence of G_i is slightly inhibitory for 
histidylation (36). Instead, CcHisRS relies heavily on the 
nucleotides of the GUG anticodon, and to a more modest 
extent on nucleotides in the aminoacyl-acceptor stem, 
particularly a G+1-U72 wobble base pair and an unusual 
bacterial A 73 discriminator nucleotide. Although 
A. castellanii tRNA Hls contains the A 73 that is typical of 
eukaryotic tRNA Hls as well as the GUG anticodon 
sequence, it does not contain the +1-72 GU wobble 
base pair, and histidylation by AcaHisRS is not inhibited 
by the presence of the G_i nucleotide (Figure 4). 
Therefore, absence of the G_i identity element is 
accommodated by distinct mechanisms in these two 
examples from separate domains of life. 

In terms of the synthetase enzymes themselves, several 
amino acid replacements and small insertions in 
oc-proteobacterial HisRS of the RC-clade (including 
CcHisRS) appear to have co-evolved with the loss of 
this identity element from the tRNA. One of the most 
important of these is the replacement of an otherwise 
highly conserved Q224 in Motif lib with G224 in the 
a-proteobacterial HisRS enzymes (35). Q224 interacts 
with the G_i-C 73 base pair that is the canonical feature 
of bacterial tRNA Hls , and thus this Q-to-G change in the 
RC-clade of a-Proteobacteria is likely to play a role in 
tRNA recognition. Again, it is unlikely that AcaHisRS 
uses a similar mechanism to achieve AG_i-tRNA Hls rec- 
ognition, as AcaHisRS shares overall sequence similarity 
with other eukaryotic HisRS enzymes and accordingly 
does not have the same conserved motif II sequence 
(including Q/G224) as a typical bacterial HisRS. Further 
investigation of the molecular basis for the non-canonical 
tRNA 1S recognition by AcaHisRS will be important and 
may lead to the ability to use this second example of an 
orthogonal HisRS/tRNA Hls pair for additional applica- 
tions, such as incorporation of histidine amino acid 
analogs into proteins in a site-selective manner. 

Interestingly, the mitochondrial HisRS in S. cerevisiae 
(which is encoded by the same gene as the cytosolic HisRS 
and uses an alternative translation start site to generate 
the mitochondrially targeted protein) does not depend on 
the presence of G_i for aminoacylation of the 
mt-tRNA Hls encoded in yeast (37). However, we note 
that the mt-tRNA Hls in 5. cerevisiae retains the encoded 
G_i residue, so despite the ability of the mt-HisRS to 
aminoacylate the tRNA that lacks G_i, the identity 
element is apparently still present in the mature tRNA 
species. Moreover, this same SceHisRS enzyme remains 
highly dependent on the presence of G_i for 
aminoacylation of cytosolic tRNA Hls , suggesting that 
although there may be unique features of mt-tRNA Hls 
that result in alternative tRNA recognition properties 
for this particular tRNA, the fundamental nature of the 
SceHisRS in terms of its G_i dependence remains as 
expected for a canonical eukaryotic HisRS, and is 
distinct from the lack of dependence on G_i observed 
with AcaHisRS in this work. Intriguingly, at least one 
other case of mt-tRNA Hls species that are only partially 
modified with G_] have been identified, suggesting that 



there may be additional complexity related to the 
presence of the identity element in organelles, although 
again, in these cases, the HisRS enzyme depends on the 
presence of G_i, as expected, and the role of the AG_|- 
tRNA Hls in the mitochondria remains unknown (38). 

The aberrant nature of tRNA Hls in A. castellanii and the 
resulting effects on tRNA Hls identity described here have 
implications for other eukaryotic species. Although a 
bona fide Thgl ortholog is identifiable in the majority of 
eukaryotes for which genomic sequences are available, 
there are other species that similarly lack an evident 
Thgl gene (17). Some of these species, like A. castellanii, 
may obviate the need for Thgl -catalysed G_i addition by 
use of a AG_ r type tRNA Hls . Identification of other or- 
ganisms for which this is the case might provide clues to 
the evolutionary pathway of this unusual tRNA identity 
feature. It is also possible that in some Thgl-deficient eu- 
karyotes that encode one or more TLPs, one that catalyses 
the cytosolic G_ t addition activity may yet be identified, 
thereby allowing retention of the canonical identity 
element. Identification of TLPs of this type may again 
provide important data to help understand the evolution 
of the non-templated G_i addition activity that is a 
unique feature of eukaryotic Thgl enzymes. In this 
regard, it will be interesting to investigate organisms 
(such as the fungus P. nodorum) that encode TLPs that 
do not contain mitochondrial targeting signals and whose 
mt-tRNAs (as inferred from secondary structure 
modeling) do not require 5'-editing (17). Finally, there 
may be alternative non-Thgl -dependent pathways that 
remain to be identified whereby the G_ r residue on 
tRNA Hls might be acquired. The results described here, 
however, clearly challenge the previously accepted 
paradigm that tRNA Hls identity and the presence of a 
G_j residue are inextricably linked in eukaryotes; as 
such, these observations represent an important step 
toward understanding the multiple pathways to tRNA 1S 
identity in biology. 

The question of the precise biological role of the two 
encoded TLPs in A. castellanii remains to be addressed 
completely. Biochemically, as with other TLPs studied 
previously, both AcaTLPs possess efficient 3—5' polymer- 
ase activity when acting on truncated tRNAs, suggestive 
of a role in tRNA repair (Supplementary Table SI, 
Figure 1). As tRNA 5'-repair is a required activity for 
5'-editing of mt-tRNAs, one or both of these enzymes 
may participate in this process, which has now been 
either predicted or demonstrated to occur in the 
mitochondria of a number of other unicellular eukaryotes 
in addition to A. castellanii (26,39,40). The predicted mito- 
chondrial localization of AcaTLP2 suggests that it is likely 
to localize to the correct compartment to participate in 
this activity, but the actual subcellular location of TLPs 
has not been demonstrated in any organism. It is 
intriguing that A. castellanii, like D. discoideum, has two 
separate genes whose protein products possess the bio- 
chemical functionality to participate in the editing 
process, only one of which contains a predicted mitochon- 
drial targeting signal. Whether in protozoa these enzymes 
form heteromeric 5'-editing complexes during editing, or 
whether the two enzymes function in an altogether 
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different cellular compartment (e.g. mitochondria and 
cytosol) remains an interesting question that is currently 
being addressed. 
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